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DETAILED ACTION 

1 . Claims 1-14,16-32, 34-47, and 49-5 1 have been examined. 

Papers Submitted 

2. It is hereby acknowledged that the following papers have been received and placed of 
record in the file: Amendment as received on 1 1/21/2007. 

Claim Objections 

3. Claim 13 is objected to because of the following informalities: In the paragraph labeled 
"(iii)", line 5, replace "sets data" with -set of data--, as this is how the claim will be interpreted. 
Appropriate correction is required. 

4. Claim 16 is objected to because of the following informalities: In the paragraph labeled 
"(ii)", line 4, replace "sets data" with —set of data--, as this is how the claim will be interpreted. 
Appropriate correction is required. 

Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

6. Claims 1, 6-10, 12-14, and 51 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Inagami et al., U.S. Patent No. 4,881,168 (as applied in the previous Office Action and 
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herein referred to as Inagami), in view of Luk et al., U.S. Patent No. 5,883,814 (herein referred 
to as Luk), and further in view of Farmwald et al., U.S. Patent No. 5,243,703 (herein referred to 
as Farmwald). In addition, Garcia, U.S. Patent No. 4,827,476, is cited as extrinsic evidence 
which shows that DRAM is inaccessible during refresh cycles. 

7. Referring to claim 1, Inagami has taught a processor comprising: 

a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig. 2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less contmiunication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) a row address register that holds a pointer that points to a row of the DRAM array. See Fig. 1 , 
component 60, and column 5, lines 13-17. 

c) one or more sets of registers, each of said sets of registers capable of loading or storing an 
entire row of the DRAM array in response to a single latch signal. See Fig.l, Fig.4, Fig.5, and 



Application/Control Number: 10/074,779 Page 4 

Art Unit: 2183 

column 1, lines 52-64, and column 6, lines 48-53. Note that each 4-register set is capable of 
loading/storing an entire row of the DRAM array in response to a single load instruction (latch 
signal). Note that a single DRAM array row may correspond to a single entry shown in storage 
21 in Fig.4. For instance, data aO is in row 0, data al is in row 1, etc. Or, in an alternate 
interpretation, a single DRAM row may correspond to four consecutive entries shown in storage 
21 in Fig.4. A row is merely a number of items arranged in a line. Hence, any number of data 
items in storage 21 can be called a "row", 
d) an instruction set which includes: 

(i) at least one command to perform an arithmetic operation on said row address register. 
See Fig.6, column 7, line 66, to column 8, line 9, and note that arithmetic is performed on 
the row address register by the start address calculation unit 310. 

(ii) a command to load a plurality of words of a precharged row into designated sets of 
data registers. See Fig.4, and column 1, lines 52-59. Note also that in DRAM, the row 
must be precharged before it is read, as this is how DRAM technology functions. 

(iii) a command to load selected columns of a precharged row pointed to by said row 
address register into designated sets of data registers, said selection based on bits in a 
mask, wherein all the selected columns of the precharged row are loaded into the 
designated sets of data registers in a single operation. Again, see Fig.4, the abstract, 
column 1, lines 52-59, and column 6, line 37, to column 7, line 8. Note that in a first 
interpretation, aO-a3 columns are loaded into the designated register set in response a 
single load instruction. And, in an alternate interpretation, each of the sets is loaded 
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based on a mask 22 in response to a single parallel load operation, where the parallel load 
includes individual load instructions. 

(iv) Inagami in view of Luk has not explicitly taught a command to precharge (activate) 
the row pointed to by said row address register. However, it is known that precharging of 
DRAM must occur. Farmwald has taught an instruction which explicitly precharges a 
row. See column 7, lines 49-55, and note that a bus transaction instruction is issued. It 
includes the fields shown in Fig.4. When bit 2 of AccessType is set to the appropriate 
value or when the 3^** bit of AccessType is set to the appropriate value, a row precharge 
will occur. See column 12, lines 34-53, and column 11, line 25-60. Note that such a 
command allows for precharging ahead of time so that when an actual load occurs, for 
instance, the load can begin immediately, thereby saving time. Consequently, it would 
have been obvious to one of ordinary skill in the art to include a precharge contmiand in 
the instruction set, as taught by Farmwald. 

(v) Inagami in view of Liik has not explicitly taught a command to deactivate said row 
pointed to by said row address register after it had been precharged by the command to 
precharge. However, it is known that any load that occurs in a DRAM is a destructive 
load. That is, reading a value in DRAM causes the charges making up that value to 
diminish. Consequently, after a value is read, the DRAM row must be refreshed. 
Farmwald has also taught a command which performs an explicit refresh. See Fig.4 and 
column 12, lines 58-60. As is known in the art, a refi'esh deactivates the DRAM because, 
while it is being refi-eshed, no read/write requests can be serviced. See Garcia, column 1, 
line 66, to column 2, line 1 for support for this concept. As a result, in order to ensure 
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that the DRAM doesn't lose its contents after a read, it would have been obvious to one 
of ordinary skill in the art at the time of the invention to modify Inagami to include a 
command to perform a refresh (i.e., a deactivate) of a particular row after it had been 
precharged (and read). 

8. Referring to claim 6, Inagami in view of Luk and further in view of Farmwald has taught 
a processor as described in claim 1 . Inagami has further taught a plurality of DRAM arrays. 
More specifically, the smallest array possible is either 1 row or 1 column. Clearly, Inagami has 
more rows and columns than this; otherwise, very little data would be stored. Consequently, 
Inagami would have many DRAM arrays (for instance, may 1x1 memory arrays). 

9. Referring to claim 7, Inagami in view of Luk and further in view of Farmwald has taught 
a processor as described in claim 1 . Inagami has fiirther taught: 

a) at least one fiinctional unit. See Fig.l, component 5. 

b) whereby said one or more sets of registers comprise a plurality of register files (see Fig.l, 
components Vro-VR7 and VMR0-VMR7), each of said register files comprising a parallel 
access port operative to load or store contents of said register file in a single cycle from or to a 
DRAM row as selected by said row-address register (again see Fig.l), each of said register files 

further comprising at least a second access port operative to transfer data between said functional 
unit and a selected subset register in said register file (note that the register files are coupled to 
the functional units 5 via switches). 

10. Referring to claim 8, Inagami in view of Luk and fiirther in view of Farmwald has taught 
a processor as described in claim 7. Inagami has fiirther taught: 

a) a second fimctional unit. See Fig.l, component 2, for instance. 
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b) whereby said first functional unit executes a first command to perform logical processing on 
the contents of one or more registers within a selected active one of said register sets (as seen in 
Fig.l, component 5 performs operations on register operands), and said second functional unit 
executes a second command to parallely transfer data between a selected inactive one of said 
register sets and said DRAM array (note that the load/store pipes do the loading and that a 
register file may be loaded while not being used to supply operands, i.e., inactive). 

1 1 . Referring to claim 9, Inagami in view of Luk and fiirther in view of Farmwald has taught 
a processor as described in claim 8. Inagami has not exphcitly taught that said first and second 
functional units execute said first and second commands substantially contemporaneously. 
However, it should be noted that the load/store pipes and the operation pipes are completely 
separate (see Fig. 1). That is, there is not a single unit which executes both instructions. 
Consequently, both the first and second commands could execute simultaneously and this would 
result in the most efficiency as two operations may be performed in less time than performing 
one after the other. As a result, it would have been obvious to execute the first and second 
commands simultaneously because Inagami 's hardware supports it. 

12. Referring to claim 10, Inagami in view of Luk and fiirther in view of Farmwald has 
taught a processor as described in claim 8. Inagami has further taught: 

a) a first software module comprising a set of data manipulation commands, said first software 
module executed by said first functional unit. It is inherent that a group of instructions exist 
which causes the manipulation of register data. These commands, which include multiplication, 
addition, etc. (column 4, lines 35-37), would be executed by the operation pipes 5 (Fig.l). 
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b) a second software module comprising a set of parallel data transfer commands, said second 
software module being executed by said second fimctional unit. See column 1, lines 52-64, and 
note that these transfer commands are executed by the second fimctional unit (load/store pipes). 
See column 4, lines 32-35. 

c) whereby said second software module operates in support of said first software module to 
prefetch data from said DRAM array into one of said register files in advance of said data being 
needed by said first software module. Clearly, when load instructions are executed, they are 
executed to bring data into the memory so that subsequent instructions may use that data. This is 
prefetching in that the data is prefetched before the consumer instruction actually requires it. 

1 3 . Referring to claim 12, Inagami in view of Luk and further in view of Farmwald has 
taught a processor as described in claim 1 . Inagami in view of Luk and fiirther in view of 
Farmwald has fiirther taught: 

a) first and second sets of functional units (Fig.l, components 5 and 2 respectively), said first and 
second sets of functional mits having respective first and second instruction subsets. Clearly, 
the first fimctional unit will execute instructions such as multiplication and addition 
(manipulation instructions) while the second fimctional unit executes instructions for data 

transfer. See Inagami, column 4, lines 32-37. 

b) whereby the second instruction subset includes said commands (ii), (iii), (iv) and (v). Since 
commands (ii)-(v) deal with memory, they will be executed by component 2. 

c) Inagami has not explicitly taught that command (i) is part of the first instruction subset. 
However, the row address register holds memory addresses and Official Notice is taken that 
memory addresses may be generated by arithmetic units. This would allow for different types of 
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addressing modes to exist, such as indirect addressing. As a result, it would have been obvious 
to one of ordinary skill in the art at the time of the invention to modify Inagami such that 
command (i) is part of the first instruction subset. 
14. Referring to claim 13, Inagami has taught a processor comprising: 

a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig.2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less communication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami 's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) a row address register that holds a pointer that points to a row of the DRAM array. See Fig. 1 , 
component 60, and column 5, lines 13-17. 

c) one or more sets of data registers, each of said sets of data registers capable of loading or 
storing an entire row of the DRAM array in response to a single latch signal. See Fig. 1 , Fig.4, 
Fig.5, and column 1, lines 52-64, and column 6, lines 48-53. Note that each 4-register set is 
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capable of loading/storing an entire row of the DRAM array in response to a single load 
instruction (latch signal). Note that a single DRAM array row may correspond to a single entry 
shown in storage 21 in Fig.4. For instance, data aO is in row 0, data al is in row 1, etc. Or, in an 
alternate interpretation, a single DRAM row may correspond to four consecutive entries shown 
in storage 21 in Fig.4. A row is merely a number of items arranged in a line. Hence, any 
number of data items in storage 21 can be called a "row". 

d) a bit mask to select one or more data locations within at least one of said register sets. See 
Fig.4 and Fig.5, component 22. 

e) an instruction set which comprises at least: 

(i) a command to perform arithmetic operations on said row address register. See Fig. 6, 
column 7, line 66, to column 8, line 9, and note that arithmetic is performed on the row 
address registers by the start address calculation unit 310. 

(ii) a command to load a set of selected elements of the row pointed to by said row 
address register into a selected set of said data registers, said selection of elements based 
on bits in said mask, wherein all the selected elements of the precharged row are loaded 
into the selected set of data registers in a single operation. Again, see Fig.4, the abstract, 
column 1, lines 52-59, and column 6, line 37, to column 7, line 8. Note that in a first 
interpretation, aO-a3 columns are loaded into the selected register set in response a single 
load instruction. 

(iii) Inagami in view of Luk has not explicitly taught a command to precharge (activate) 
the row pointed to by said row address register. However, it is known that precharging 
must occur. Farmwald has taught an instruction which explicitly precharges a row. See 
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column 7, lines 49-55, and note that a bus transaction instruction is issued. It includes the 
fields shown in Fig.4. When bit 2 of AccessType is set to the appropriate value or when 
the 3'^'' bit of AccessType is set to the appropriate value, a row precharge will occur. See 
column 12, lines 34-53, and column 11, line 25-60. Note that such a command allows for 
precharging ahead of time so that when an actual load occurs, for instance, the load can 
begin immediately, thereby saving time. Consequently, it would have been obvious to 
one of ordinary skill in the art to include a precharge command in the instruction set, as 
taught by Farmwald. 

f) wherein the command to precharge is executed to precharge the row prior to the command to 
load so that at the time the command to load is issued, the command to load can execute without 
the need to wait for the row to precharge. See column 11, lines 25-34, of Farmwald. 

1 5 . Referring to claim 14, Inagami in view of Luk and further in view of Farmwald has 
taught a processor as described in claim 13. Inagami has further taught that said load command 
causes an entire row that was previously precharged to be loaded. Note from Fig.4 that if the 
mask were set to all 1 's, then the entire row would be loaded. 

16. Referring to claim 5 1 , Inagami in view of Luk and fiirther in view of Farmwald has 

taught a processor as described in claim 1 . Inagami has fiirther taught that each of said sets of 
registers is capable of being loaded or stored in response to a single latch signal without a 
caching system that employs cache hits and cache misses. Se Fig.l and note that a cache is not 
disclosed anywhere in Inagami, and therefore, everything performed in Inagami is performed 
without a cache employing hits and misses. 
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17. Claims 23-25 are rejected under 35 U.S.C. 103(a) as being unpatentable over Inagami in 
view of Luk. 

18. Referring to claim 23, Inagami has taught a processor comprising: 

a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig.2. Luk has fiirther taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitivc loads, low power dissipation, and less communication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami 's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) a row address register that holds a pointer that points to a row of the DRAM array. See Fig. 1 , 
component 60, and column 5, lines 13-17. 

c) first and second register files, each of said register files capable of loading or storing an entire 
row of the DRAM array in response to a single latch signal. See Fig.l, Fig.4, Fig.5, and column 
1, lines 52-64, and column 6, lines 48-53. Note that each 4-register set is capable of 
loading/storing an entire row of the DRAM array in response to a single load instruction (latch 
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signal). Note that a single DRAM array row may correspond to a single entry shown in storage 
21 in Fig.4. For instance, data aO is in row 0, data al is in row 1, etc. Or, in an alternate 
interpretation, a single DRAM row may correspond to four consecutive entries shown in storage 
21 in Fig.4. A row is merely a number of items arranged in a line. Hence, any number of data 

items in storage 21 can be called a "row". 

d) an embedded processor comprising first and second fiinctional units, said first and second 
functional units having respective first and second instruction sets and capable of accessing said 
first and second register files. See Fig.l and note that the first set of fiinctional units could be 
component 5, which executes manipulation instructions like multiplication, addition, etc. See 
column 4, lines 35-37. The second set of functional units could be component 2 of Fig.l, which 
executes data transfer instructions such as loads and stores. See column 4, lines 32-35. 

e) wherein said first and second register files comprise a parallel access port (see connections 
between register files and switches 120-123) operative to parallely transfer contents of one of 
said register files between a DRAM row as selected by said row-address register (again see Fig.l 
and note that these connections allow register files to be loaded with DRAM contents), said first 
and second register files fiirther comprising at least a second access port operative to transfer 
data between a selected one of said register files and said second functional unit (see the 
connections in Fig.l between the switches 120-123 and the load/store pipes (second functional 
unit. . .these allow data to be read from the register files by the pipes and ultimately stored in 
DRAM). 

f) wherein said first instruction set comprises at least: 
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(i) a command to manipulate data in a data register with a register file. See column 4, 
lines 35-37. 

g) wherein said second instruction set comprises at least: 

(i) a command to perform an arithmetic operation on said row address register. See 
Fig. 6, column 7, line 66, to column 8, line 9, and note that arithmetic is performed on the 
row address registers by the start address calculation unit 310. 

(ii) a command to load the entire row pointed to by said row address register into a 
selected set of registers of said register files in a single operation. See Fig.4, and column 
1, lines 52-59. Note that each row of the memory shown in Fig.4 which corresponds to a 
mask bit equal to 1 is transferred to a register file. For instance, aO, which is clearly the 
first row in the main storage, is transferred to a register file. The data aO is the entire row 
data. 

19. Referring to claim 24, Inagami in view of Luk has taught a processor as described in 
claim 23. Inagami has not explicitly taught that said first and second functional units each 
respectively execute a contmiand from said first and second instruction sets substantially 
contemporaneously. However, it should be noted that the load/store pipes 2 and the operation 
pipes 5 are completely separate (see Fig. 1). That is, there is not a single unit which executes 
both instructions. Consequently, both the first and second commands (for instance, an add and a 
load) could execute simultaneously and this would result in the most efficiency as two operations 
may be performed in less time than performing one after the other. As a result, it would have 
been obvious to execute the first and second commands simultaneously because Inagami 's 
hardware supports it. 
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20. Referring to claim 25, Inagami in view of Luk has taught a processor as described in 
claim 24. Inagami has further taught: 

a) a first software module comprising data manipulation commands drawn from said first 
instruction set, said first software module executed by said first functional unit. It is inherent that 

a group of instructions exist which causes the manipulation of register data. These commands, 
which include multiplication, addition, etc. (column 4, lines 35-37), would be executed by the 
operation pipes 5 (Fig.l). 

b) a second software module comprising a parallel data transfer command drawn from said 
second instruction set, said second software module being executed by said second functional 
unit. See column 1, lines 52-64, and note that these transfer commands are executed by the 
second functional unit (load/store pipes). See column 4, lines 32-35. 

c) whereby said second software module operates in support of said first software module to 
prefetch data from said DRAM array into one of said register files in advance of said data being 
needed by said first software module. Clearly, when load instructions are executed, they are 
executed to bring data into the memory so that subsequent instructions may use that data. This is 
prefetching in that the data is prefetched before the consumer instruction actually requires it. 

21 . Claims 2-5 and 1 1 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Inagami in view of Luk in view of Farmwald and fiirther in view of Parady, U.S. Patent No. 
5,933,627. 

22. Referring to claim 2, Inagami in view of Luk in view of Farmwald has taught a processor 
as described in claim 1 . 
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a) Inagami has further taught first and second sets of functional units, said first and second sets 
of functional units having respective first and second instruction sets and capable of accessing 
first and second sets of said register. See Fig.l, components 5-0 to 5-3, and note that these 
functional units may access any of the register sets. 

b) Inagami has not taught a command to select one of said first and second sets of registers to be 
an architectural set of registers accessible to said first set of functional units, a command to 
deselect the other of said first and second sets of registers so that it is no longer an architectural 
register set accessible to said first set of functional units, a command to select one of said first 
and second sets of registers to be an architectural set of registers accessible to said second set of 
functional units, and a command to deselect the other one of said first and second sets of 
registers so that it is no longer an architectural register set accessible to said second set of 
functional units. However, Parady has taught a thread switch conmiand which performs such 
operations. More specifically, in Parady, a first set of functional units may comprise components 
38 and 40 in Fig.l (which would execute a first set of instructions comprising floating-point 
addition, subtraction, and multiplication operations). In response to a thread switch conmiand, a 
new register set (Fig.l, component 50) corresponding to the switched-in (current) thread will be 
made accessible to the first set of functional units. Meanwhile, the previously used register set 
corresponding to the switch-out (inactive) thread is no longer accessed by the first set of 
functional units. See the abstract and column 2, lines 18-39. Likewise, the thread switch 
command will also make the new register set available to a second set of functional units (for 
example, components 42 and 44 in Fig.l, which would execute a second set of instructions 
comprising floating-point division and graphical addition and subtraction operations), whereas 
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the previously used register set will become "invisible" to the second set of functional units. A 
person of ordinary skill in the art would have recognized that multithreading and thread 
switching are beneficial tools for a system because when one thread stalls, another thread may be 
switched in and executed, thereby preventing the processor from going idle. With thread 
switching comes the switching of register files as well. Therefore, it would have been obvious to 
one of ordinary skill in the art at the time of the invention to modify Inagami to toggle between 
active and inactive register file states because switching of register sets supports thread 
switching, which as described above, prevents a processor from staying idle during a stall, 
thereby increasing throughput and efficiency. 

23 . Referring to claim 3, Inagami in view of Luk in view of Farmwald has taught a processor 
as described in claim 1 . 

a) Inagami has fiirther taught first and second sets of fiinctional units, said first and second sets 
of functional units having respective first and second instruction sets and capable of accessing 
first and second sets of said registers. See Fig.l, components 5-0 to 5-3, and note that these 
fimctional units may access any of the register sets. 

b) Inagami has not taught a command which selects one of said first and second sets of registers 
to be an architectural set of registers accessible to said first set of functional units, and at the 
same time, which deselects the other one of said one of said first and second sets of registers to 
be an architectural set of registers accessible to said second set of functional units. However, 
Parady has taught a thread switch command which performs such operations. More specifically, 
in Parady, a first set of fimctional units may comprise components 34 and 36 in Fig.l (which 
would execute a first set of instructions comprising integer ALU, multiplication, and division 
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operations). In response to a thread switch command, a new register set (Fig.l, component 48) 
corresponding to the switched-in (current) thread will be made accessible to the first set of 
functional units (integer registers for integer fimctional units). At the same time, the new register 
set will not be accessible to a second set of fimctional unit comprising components 38 and 40 in 
Fig.l (which would execute floating-point addition, subtraction, and multiplication operations). 
This is because the selected register file is an integer register file and floating-point functional 
units would not access the integer register file (they would access a fioating-point register file 
50). A person of ordinary skill in the art would have recognized that multithreading and thread 
switching are beneficial tools for a system because when one thread stalls, another thread may be 
switched in and executed, thereby preventing the processor fi-om going idle. With thread 
switching comes the switching of register files as well. Therefore, it would have been obvious to 
one of ordinary skill in the art at the time of the invention to modify Inagami to toggle between 
active and inaccessible register file states because switching of register sets supports thread 
switching, which as described above, prevents a processor from staying idle during a stall, 
thereby increasing throughput and efficiency. 

24. Referring to claim 4, Inagami in view of Luk in view of Farmwald has taught a processor 
as described in claim 1. Inagami has not explicitly taught the specifics of the functional units 5-0 
to 5-3, and more specifically has not taught first and second sets of functional units, said first and 
second sets of functional units having respective first and second instruction sets and accessing 
first and second sets of said registers and whereby said first and second instruction sets are 
subsets of said instruction set of said embedded-DRAM processor. However, Parady has taught 
the functional units may be of a floating-point type and of an integer type. See Fig. 1 . These two 
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sets of functional units would execute integer and floating-point instructions, respectively, 
wherein each is a subset of the overall instruction set (integer + floating-point + miscellaneous 
instructions). Having different types of fimctional unit allows for the execution of different types 
of instructions and consequently, it would have been obvious to one of ordinary skill in the art at 
the time of the invention to modify Inagami to include different types of functional units. 

25 . Referring to claim 5, Inagami in view of Luk in view of Farmwald and further in view of 
Parady has taught a processor as described in claim 4. Inagami has not explicitly taught that said 
second set of fimctional units comprises a fimctional unit that is a multi-issue fimctional unit and 
further comprises a dispatch unit and a plurality of functional units which each execute a 
respective instruction stream as dispatched by said dispatch unit. However, Parady has taught 
such a concept. Note that the second fimctional unit may comprise components 28, 38, 40, 42, 
44, and 46 of Fig. 1 . This includes a dispatch unit, which must inherently exists to dispatch 
instructions (in this case 4 instructions at a time (column 3, lines 14-17)), and a plurality of 
functional units (38-46) which may each execute dispatched instructions. Having multiple 
fimctional units and a multi-issue dispatcher allows multiple instructions to be executed at any 
given time, thereby increasing throughput. As a result, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to modify Inagami to have a multi-issue 
functional unit. 

26 . Referring to claim 1 1 , Inagami in view of Lvik in view of Farmwald has taught a 
processor as described in claim 10. Inagami has fiirther taught that: 

a) said first software module contains an instruction that reference registers within an 
architectural register set visible to said first fimctional unit, whereby said architectural register 
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set corresponds to at least partially to said one of said register files that is in an active state. See 
column 4, lines 35-37, and note addition and multiplication instructions would exist which 
operate on data retrieved from register files (Fig. 1). Clearly, if the functional unit 5 is retrieving 
data from a register file, then that register file is visible and in an active state. 

b) said second software module contains instructions that cause data to be transferred between an 
inactive register set and said DRAM array. As discussed above, a register file does not need to 
be read every cycle. Therefore, it may be inactive with respect to the fimctional units. However, 
since the load pipes are separate from the fimctional units, the data transfer may occur even if the 
file is not being used by a functional unit. 

c) Inagami has not taught that the second software module also executes a command to toggle a 
selected register set between said active and inactive states. However, Parady has taught a thread 
switch contmiand which performs such toggling. More specifically, in Parady, every time a 
thread switch occurs, a new register file becomes active and the previously active register file 
becomes inactive. This allows for thread switching without having to save or reload a context of 
a thread, thereby saving processing time. A person of ordinary skill in the art would have 
recognized that multithreading and thread switching are beneficial tools for a system because 
when one thread stalls, another thread may be switched in and executed, thereby preventing the 
processor from going idle. With thread switching comes the switching of register files as well. 
Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention to modify Inagami to toggle between active and inactive register file states because 
switching of register sets supports thread switching, which as described above, prevents a 
processor from staying idle diiring a stall, thereby increasing throughput and efficiency. 
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27. Claims 26-27 are rejected under 35 U.S.C. 103(a) as being unpatentable over Inagami in 
view of Luk and further in view of Parady. 

28. Referring to claim 26, Inagami in view of Luk has taught a processor as described in 
claim 23. Inagami has not taught that the second instruction set further comprises a command to 
toggle a register set between an active state and an inactive state. However, Parady has taught a 
thread switch command which performs such toggling. More specifically, in Parady, every time 
a thread switch occurs, a new register file becomes active and the previously active register file 
becomes inactive. This allows for thread switching without having to save or reload a context of 
a thread, thereby saving processing time. A person of ordinary skill in the art would have 
recognized that multithreading and thread switching are beneficial tools for a system because 
when one thread stalls, another thread may be switched in and executed, thereby preventing the 
processor from going idle. With thread switching comes the switching of register files as well. 
Therefore, it would have been obvious to one of ordinary skill in the art at the time of the 
invention to modify Inagami to toggle between active and inactive register file states because 
switching of register sets supports thread switching, which as described above, prevents a 
processor from staying idle during a stall, thereby increasing throughput and efficiency. 

29. Referring to claim 27, Inagami in view of Luk and further in view of Parady has taught a 
processor as described in claim 26. Parady has fiirther taught that said toggle command causes 
said first register file to toggle fi-om the inactive state to the active state and also causes the 
second register file to toggle from the active state to the inactive state. Again, as described in the 
rejection of claim 23 above, every time a thread switch occurs, a new register file becomes active 
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and the previously active register file becomes inactive. That is, each thread has its own register 
file (column 2, lines 35-37). Consequently, when a thread switch occurs, the register file 
corresponding to the switched in thread goes fi-om inactive to active, while the register file 
corresponding to the switched out thread goes fi-om active to inactive. 

30. Claims 16-22, 28-32 and 34-43, 45 and 49-50 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Inagami in view of Luk in view of Parady, and further in view of Bissett 
et al., U.S. Patent No. 5,896,523 (herein referred to as Bissett). 

3 1 . Referring to claim 1 6, Inagami has taught a processor comprising: 

a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig. 2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less communication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami 's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 
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b) a row address register that holds a pointer that points to a row of the DRAM array. See Fig. 1 , 
component 60, and column 5, lines 13-17. 

c) first and second register files, each of said register files having a plurality of data registers 
capable of loading or storing an entire row of the DRAM array in response to a single latch 
signal. See Fig.l, Fig.4, Fig. 5, and column 1, lines 52-64, and column 6, lines 48-53. Note that 
each 4-register set is capable of loading/storing an entire row of the DRAM array in response to 
a single load instruction (latch signal). Note that a single DRAM array row may correspond to a 
single entry shown in storage 21 in Fig.4. For instance, data aO is in row 0, data al is in row 1, 
etc. Or, in an alternate interpretation, a single DRAM row may correspond to four consecutive 
entries shown in storage 21 in Fig.4. A row is merely a number of items arranged in a line. 
Hence, any number of data items in storage 21 can be called a "row". 

d) Inagami has not taught that each of said register files is also capable of being placed into an 
active state and an inactive state. However, Parady has taught a thread switch command which 
performs such state toggling. More specifically, in Parady, every time a thread switch occurs, a 
new register file becomes active and the previously active register file becomes inactive. This 
allows for thread switching without having to save or reload a context of a thread, thereby saving 
processing time. A person of ordinary skill in the art would have recognized that multithreading 
and thread switching are beneficial tools for a system because when one thread stalls, another 
thread may be switched in and executed, thereby preventing the processor from going idle. With 
thread switching comes the switching of register files as well. Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to modify Inagami to toggle 
between active and inactive register file states because switching of register sets supports thread 
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switching, which as described above, prevents a processor from staying idle during a stall, 
thereby increasing throughput and efficiency. 

e) a set of functional units that perform logical operations on data accessed from a set of 
architectural registers, wherein registers placed into the active state appear as architectural 
registers to a set of functional units, and registers in the inactive state are not accessible by the 
functional units. See Fig.l and Fig.3 of Parady. Note that multiple functional units (36, 38, 40, 
etc) would operate on instructions from the current thread, which has an associated active 
register file (Fig.3, files 48-50). Therefore, the active file is available to the fiinctional units 
while the files for the inactive threads are not available to the units. 

f) a bit mask to select one or more locations within at least one of said register files. See Fig.4 
and Fig.5, component 22. 

g) an instruction set which comprises at least: 

(i) a command to perform an arithmetic operation on said row address register. See 
Fig.6, column 7, line 66, to coliunn 8, line 9, and note that arithmetic is performed on the 
row address registers by the start address calculation unit 310. 

(ii) a command to load a set of selected elements of the row pointed to by said row 
address register into a selected set of said data registers, said selection of elements based 
on bits in said mask, wherein all the selected elements of the row are loaded into the 
selected set of data registers in a single operation. Again, see Fig.4, the abstract, column 
1, lines 52-59, and column 6, line 37, to column 7, line 8. Note that in a first 
interpretation, aO-a3 colunms are loaded into the selected register set in response a single 
load instruction. 
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Inagami in view of Parady has not taught that the selected set of said data 
registers to be loaded are in the inactive state. However, Bissett has taught performing 
data prefetches (loads) in the background. And, this is beneficial because background 
operations do not influence software execution, thereby allowing the current program to 
run normally while also accomplishing a task in the background. See column 4, lines 3- 
8. And, as is known in the art, prefetching is beneficial because data is brought in from 
memory before an instruction needs it. Then, when the instruction does actually execute, 
the data is already fetched, allowing the instruction to execute more quickly. 
Consequently, it would have been obvious to one of ordinary skill in the art at the time of 
the invention to modify Inagami in view of Parady in view of Bissett such that Inagami in 
view of Parady performs background prefetching to a register file is not currently in use 
(one that corresponds to an inactive thread). 

32. Referring to claim 17, Inagami in view of Luck in view of Parady and further in view of 
Bissett has taught a processor as described in claim 16. Parady has further taught that the 
instruction set fiirther comprises a contmiand to toggle a register set between said active and 
inactive states. As discussed in Parady' s abstract, a thread switch occurs on a cache miss during 
a load instruction's execution. Therefore, this contmiand is built into a load instruction which is 
part of the instruction set. 

33. Referring to claim 18, Inagami in view of Luck in view of Parady and further in view of 
Bissett has taught a processor as described in claim 17. Parady has fiirther taught that said toggle 
command causes said first register file to toggle from the inactive state to the active state and 
also causes the second register file to toggle from the active state to the inactive state. Again, as 
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described in the rejection of claim 16 above, every time a thread switch occurs, a new register 
file becomes active and the previously active register file becomes inactive. That is, each thread 
has its own register file (column 2, lines 35-37). Consequently, when a thread switch occurs, the 
register file corresponding to the switched in thread goes from inactive to active, while the 
register file corresponding to the switched out thread goes from active to inactive. 

34. Referring to claim 1 9, Inagami in view of Luck in view of Parady and further in view of 
Bissett has taught a processor as described in claim 16. Inagami has fiirther taught that the 
instruction set fiirther comprises a command to manipulate the bits in the bit mask. Although the 
command is not explicitly mentioned, the vector masks are stored in register files VMR0-VMR7 
and they may be written to (modified) as seen in Fig. 1 . 

35 . Referring to claim 20, Inagami in view of Luck in view of Parady and fiirther in view of 
Bissett has taught a processor as described in claim 16. Inagami has fiirther taught: 

a) first and second sets of functional units, said first and second sets of functional units having 
respective first and second sets of instructions and capable of accessing said first and second 
register sets. See Fig. 1 and note that the first set of fimctional units could be component 5, 
which executes manipulation instructions like multiplication, addition, etc. See column 4, lines 
35-37. The second set of functional units could be component 2 of Fig. 1, which executes data 
transfer instructions such as loads and stores. See column 4, lines 32-35. 

b) said instruction set fiirther comprises at least: 

(i) a command to select one of said first and second sets of registers to be an architectural 
set of registers accessible to said first set of fiinctional units. See Fig. 1 . Clearly, if a 



Application/Control Number: 10/074,779 Page 27 

Art Unit: 2183 

functional unit (5-0, for instance) executes an addition instruction, it will select at least 
one of the register files so that it may access operands, 
(ii) a command to select one of said first and second sets of registers to be an 
architectural set of registers accessible to said second set of functional units. See Fig.l. 
Clearly, if a functional unit (2-0, for instance) executes a store instruction, it will select at 
least one of the register files so that it may access an operand to be stored. 
36. Referring to claim 2 1 , Inagami in view of Luck in view of Parady and fiirther in view of 
Bissett has taught a processor as described in claim 20. Inagami has not taught that the 
instruction set further comprises a command to deselect the other of said first and second sets of 
registers so that it is no longer an architectural register set accessible to said first set of functional 
units and a command to deselect the other one of said first and second sets of registers so that it 
is no longer an architectural register set accessible to said second set of functional units. 
However, Parady has taught such a concept. More specifically, in Parady, every time a thread 
switch occurs, a new register file becomes active and the previously active register file becomes 
inactive (deselected so it is no longer accessible). This allows for thread switching without 
having to save or reload a context of a thread, thereby saving processing time. A person of 
ordinary skill in the art would have recognized that multithreading and thread switching are 
beneficial tools for a system because when one thread stalls, another thread may be switched in 
and executed, thereby preventing the processor from going idle. With thread switching comes 
the switching of register files as well. Therefore, it would have been obvious to one of ordinary 
skill in the art at the time of the invention to modify Inagami to toggle between selection and 
deselection of register files because switching of register sets supports thread switching, which as 
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described above, prevents a processor from staying idle during a stall, thereby increasing 
throughput and efficiency. 

37. Referring to claim 22, Inagami in view of Luck in view of Parady and further in view of 
Bissett has taught a processor as described in claim 20. Inagami has further taught that at least 
one of said sets of functional units contains a single fimctional unit. See Fig.l, component 5, and 
note that it contains a single unit 5-0. 

38. Referring to claim 28, Inagami has taught a processor comprising: 

a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig.2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less communication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) first and second dual-port register files, each of said register files capable of parallely 
transferring data between an entire row of said DRAM array in a single operation. See Fig. 1 , 
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Fig.4, Fig.5, and column 1, lines 52-64, and column 6, lines 48-53. Note that each 4-register set 
is capable of loading/storing an entire row of the DRAM array in response to a single load 
instruction (latch signal). Note that a single DRAM array row may correspond to a single entry 
shown in storage 21 in Fig.4. For instance, data aO is in row 0, data al is in row 1, etc. Or, in an 
alternate interpretation, a single DRAM row may correspond to four consecutive entries shown 
in storage 21 in Fig.4. A row is merely a number of items arranged in a line. Hence, any 
number of data items in storage 21 can be called a "row". Also, it should be noted that these 
files are at least dual-port in that they have ports for reading and writing (see Fig. 1) 
c) Inagami has not taught that each of said register files is also capable of being placed into an 
active state and an inactive state. However, Parady has taught a thread switch command which 
performs such state toggling. More specifically, in Parady, every time a thread switch occurs, a 
new register file becomes active and the previously active register file becomes inactive. This 
allows for thread switching without having to save or reload a context of a thread, thereby saving 
processing time. A person of ordinary skill in the art would have recognized that multithreading 
and thread switching are beneficial tools for a system because when one thread stalls, another 
thread may be switched in and executed, thereby preventing the processor fi-om going idle. With 
thread switching comes the switching of register files as well. Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to modify Inagami to toggle 
between active and inactive register file states because switching of register sets supports thread 
switching, which as described above, prevents a processor from staying idle during a stall, 
thereby increasing throughput and efficiency. 



Application/Control Number: 10/074,779 Page 30 

Art Unit: 2183 

d) first and second embedded functional units, said first and second functional units having 
respective first and second instruction sets that operate on registers of an architectural register 
set, said architectural register set comprising one of the first and second dual-port register files 
that is currently in the active state. See Fig.l and note that the first set of functional units would 
be component 5, which executes manipulation instructions like multiplication, addition, etc. See 
column 4, lines 35-37. The second set of functional units would be component 2 of Fig.l, which 
executes data transfer instructions such as loads and stores. See column 4, lines 32-35. Also, as 
taught in Parady, only the register files in the active state would be accessible. 

e) whereby said first instruction set comprises at least: 

(i) a command to manipulate data in a data register with a register file. See column 4, 
lines 35-37. 

f) whereby said second instruction set comprises at least: 

(i) a command to place an said selected inactive data register file into said active state, 
wherein when the inactive register file is activated, it becomes an architectural register set 
of said first functional unit. See Parady, and note the thread switch conmiand that from 
the absfract and from Fig.3. More specifically, when a thread switch occurs, the register 
file associated with the "switched-in" thread will become active and consequently used 
by the system during execution of that thread. 

(ii) Inagami has taught a command to unidirectionally transfer data between an entire row 
of said DRAM array and a data register file, wherein the fransfer occurs in a single 
operation. See Fig.4. Neither Inagami nor Parady have taught such a fransfer involves an 
inactive register file. However, Bissett has taught performing data prefetches (loads) in 
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the background. And, this is beneficial because background operations do not influence 
software execution, thereby allowing the current program to run normally while also 
accomplishing a task in the background. See column 4, lines 3-8. And, as is known in 
the art, prefetching is beneficial because data is brought in from memory before an 
instruction needs it. Then, when the instruction does actually execute, the data is already 
fetched, allowing the instruction to execute more quickly. Consequently, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify 
Inagami in view of Parady in view of Bissett such that Inagami in view of Parady 
performs background prefetching to a register file is not currently in use (one that 
corresponds to an inactive thread). 

(iii) ***NOTE*** As an alternate interpretation, an inactive register file could be nothing 
more than a file which is not currently being accessed by the functional units. This could 
be the case in Fig. 1, where there are many register files. Some may not be needed by 
functional units 5. These would be inactive and a load instruction may be the instructions 
used to transfer data to the inactive file. In addition, when the functional units do request 
operands from the inactive file (say in response to an add instruction), then the inactive 
file becomes active and accessible to the functional unit. Applicant should amend 
accordingly to avoid both interpretations of Inagami, etc. 
39. Referring to claim 29, Inagami in view of Lvik in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami has further taught that said 
conmiand to unidirectionally transfer data causes data to be transferred from a row of the DRAM 
array to said selected inactive data register file. See Fig.4. 
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40. Referring to claim 30, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami has further taught that said 
command to unidirectionally transfer data causes data to be transferred from said selected 
inactive data register file to a row of the DRAM array. See Fig.5. 

4 1 . Referring to claim 3 1 , Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Parady has further taught that the said 
command to place the selected inactive register file into the active state is a command that also 
causes the remaining register file to toggle from the active state into the inactive state. As 
discussed in Parady's abstract, a thread switch occurs on a cache miss during a load instruction's 
execution. Therefore, this command is built into a load instruction which is part of the 
instruction set. 

42 . Referring to claim 32, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. 

a) Inagami has further taught at least one additional register file. Note that there are at least 16 
register files shown in Fig.l (VMR0-VMR7 and VR0-VR7). 

b) Parady has fiirther taught that said command to place the selected inactive register file into the 

active state is a command that also causes a selected other register file to toggle from the active 
state into the inactive state. As discussed in Parady's abstract, a thread switch occurs on a cache 
miss during a load instruction's execution. Therefore, this command is built into a load 
instruction which is part of the instruction set. 

43 . Referring to claim 34, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami has further taught: 
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a) at least one bit mask. See Fig.4, component 22. 

b) the second instruction set further comprises a command to move a subset of elements between 
a selected register file and a selected row of said DRAM array, whereby said subset is identified 
by said bit mask. See fig.4 and Fig.5. 

44. Referring to claim 35, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. 

a) Inagami has not explicitly taught a dispatch unit but it is deemed inherent by the examiner that 
such a component exists. That is, instructions must be dispatched somehow to the functional 

units. Consequently, a dispatch unit is required. 

b) Inagami has further taught a plurality of functional units that each execute a respective 
instruction stream as dispatched by said dispatch unit. See functional units 5-0 to 5-3 in Fig.l. 
Each of units 5 may execute mult, add, etc., all of which must be dispatched. 

c) Inagami has not explicitly taught that the first functional unit is a multi-issue functional unit. 
However, Parady has taught such a concept. Note that the functional unit of Parady may 
comprise components 28, 38, 40, 42, 44, and 46 of Fig.l. This includes a dispatch unit, which 
must inherently exists to dispatch instructions (in this case 4 instructions at a time (column 3, 
lines 14-17)), and a plurality of functional units (38-46) which may each execute dispatched 
instructions. Having multiple functional units and a multi-issue dispatcher allows multiple 
instructions to be executed at any given time, thereby increasing throughput. As a result, it 
would have been obvious to one of ordinary skill in the art at the time of the invention to modify 
Inagami to have a multi-issue functional unit. 
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45 . Referring to claim 36, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami has further taught: 

a) a first software module comprising a set of data manipulation commands drawn fi-om said first 
instruction set, said first software module executed by said first functional unit. It is inherent that 
a group of instructions exist which causes the manipulation of register data. These commands, 
which include multiplication, addition, etc. (column 4, lines 35-37), would be executed by the 
operation pipes 5 (Fig.l). 

b) a second software module comprising a set of parallel data transfer commands drawn fi-om 
said second instruction set, said second software module being executed by said second 
functional unit. See column 1 , lines 52-64, and note that these transfer commands are executed 
by the second functional unit (load/store pipes). See column 4, lines 32-35. 

c) whereby said second software module operates in support of said first software module to 
prefetch data from said DRAM array into one of said register files in advance of said data being 
needed by said first software module. Clearly, when load instructions are executed, they are 
executed to bring data into the memory so that subsequent instructions may use that data. This is 
prefetching in that the data is prefetched before the consumer instruction actually requires it. 

46. Referring to claim 37, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami has further taught that: 

a) said first software module contains an instruction that references registers within an 
architectural register set visible to said first functional unit, whereby said architectural register 
set corresponds to at least partially to said one of said register files that is in an active state. See 
column 4, lines 35-37, and note addition and multiplication instructions would exist which 
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operate on data retrieved from register files (Fig.l). Clearly, if the fiinctional unit 5 is retrieving 
data fi-om a register file, then that register file is visible and in an active state. 

b) said second software module contains instructions that cause data to be transferred between an 
inactive register set and said DRAM array. As discussed above, a register file does not need to 

be read every cycle. Therefore, it may be inactive with respect to the functional units. However, 
since the load pipes are separate from the functional units, the data transfer may occur even if the 
file is not being used by a fiinctional unit. 

c) Finally, recall that Parady has taught that the second software module also executes a 
command to toggle a selected register set between said active and inactive states. More 
specifically, Parady has taught a thread switch command which performs such toggling. Every 
time a thread switch occurs, a new register file becomes active and the previously active register 
file becomes inactive. 

47. Referring to claim 38, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami has fiirther taught that each of 
said register files contain a number of words, N, matched to the number of words in a row of said 
DRAM array, and said unidirectional transfer comprises moving said selected row in its entirety 
to said selected register file. See Fig.4 and note that if the vector is all 1 's then all of the rows 
will be moved to all of the registers. That is the register file may accommodate N words from 
DRAM. 

48. Referring to claim 39, Inagami in view of Luk in view of Parady and fiirther in view of 
Bissett has taught a processor as described in claim 28. Inagami has fiirther taught a mask and 
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switch unit interposed between said DRAM array and at least one of said register files. See Fig.4 
andFig.5. 

49. Referring to claim 40, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami in view of Luk in view of 
Parady and further in view of Bissett has not explicitly taught that said second set of instructions 
comprises a command to cause data to be moved from one register to another within a given one 
of said register files (individual register-to-register move operations). However, Official Notice 
is taken the register-register move operations are well known and expected in the art. These 
operations at the very least allow for the copying of registers. Consequently, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to implement register- 
register move operations. 

50. Referring to claim 4 1 , Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. Inagami has further taught that said 
second instruction set is used to implement an intelligent caching scheme, whereby said register 
files act as a cache and said second set of instructions are executed in lieu of a standard cache 
that maintains most recently used data and enforces a set associative or a direct-mapped caching 
policy. It should be noted that caches are not discussed in any way whatsoever in Inagami. 
Consequently, it is determined that Inagami does not employ a cache. And, register files hold 
recent operand data which is to be used by the processor for operations. As a result, the register 
files act as a cache for holding recent operation data. 

5 1 . Referring to claim 42, Inagami in view of Luk in view of Parady and further in view of 
Bissett has taught a processor as described in claim 28. 
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a) Inagami has not explicitly taught an instruction register coupled to receive instructions from 
said instruction set, said instruction register operative to hold an instruction to be executed by a 
data assembly unit. However, the examiner has deemed an instruction register (IR) as being a 
component which inherently exists within a system that executes instructions. That is, when 

instructions are fetched from memory, they must be held in an IR so that they may be decoded 
and executed. This includes instructions to be executed by the data assembly unit (Fig. 1 , 
component 2). 

b) Inagami has not explicitly taught a local program memory, but this is also inherent as 
instructions must be stored in some form of memory if they are to be executed. 

c) Inagami has not taught that said second functional unit corresponds to said data assembly unit, 
and said data assembly unit receives an instruction from said second instruction set that causes a 
separate control thread of instructions to be accessed from said local program memory and 
executed by said data assembly unit. However, Parady has taught this concept. As discussed 
above, when a load is executed (by the data assembly unit) but it misses the cache, a thread 
switch occurs, thereby causing a new thread of instructions to be accessed and executed. 

52. Referring to claim 43, Inagami in view of Luk in view of Parady and fiirther in view of 
Bissett has taught a processor as described in claim 42. 

a) Inagami has not explicitly taught a prefetch unit that prefetches instructions from the first and 
second instruction sets from a single very long instruction word (VLIW) instruction memory. 
However, Official Notice is taken that both prefetching and VLIW instructions are well known 
concepts in the art. Prefetching allows instructions/data to be fetched before they are actually 
needed. Therefore, when they are needed, they will be available immediately, as opposed to 
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fetching them at that time. This will increase efficiency. Consequently, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to modify Inagami to 
include a prefetch unit. Also, VLIW instructions allow for multiple instructions to be executed 
at the same time and since they are bundled together during compilation, the need for dynamic 
scheduling is reduced. As a result, it would have been obvious to one of ordinary skill in the art 
at the time of the invention to modify Inagami to implement VLIW instructions, 
b) Inagami has not explicitly taught a dispatch unit that dispatches instructions from the first 
instruction set to the fiinctional units and dispatches instructions from the second instruction 
stream to the data assembly unit. However, such a component is deemed as inherently existing 
by the examiner. That is, instructions must be dispatched somehow to the functional units. 
Consequently, a dispatch unit is required. 

53. Referring to claim 45, Inagami has taught a processor comprising: 
a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig.2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less communication time between the two 
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components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) first and second dual-port register files (see Fig.l, components VMR0-VMR7 and VR0-VR7), 
whereby the first port of each of said register files is a parallel access port and is parallely 
coupled to said DRAM array (see Fig.l, note that if transfers occur between DRAM and 
registers, as shown in Fig.4 and Fig.5, then a port must couple registers to DRAM). 

c) Inagami has not taught that each of said register files is capable of being placed into an active 
state and an inactive state. However, Parady has taught a thread switch command which 
performs such state toggling. More specifically, in Parady, every time a thread switch occurs, a 
new register file becomes active and the previously active register file becomes inactive. This 
allows for thread switching without having to save or reload a context of a thread, thereby saving 
processing time. A person of ordinary skill in the art would have recognized that multithreading 
and thread switching are beneficial tools for a system because when one thread stalls, another 
thread may be switched in and executed, thereby preventing the processor firom going idle. With 
thread switching comes the switching of register files as well. Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to modify Inagami to toggle 
between active and inactive register file states because switching of register sets supports thread 
switching, which as described above, prevents a processor from staying idle during a stall, 
thereby increasing throughput and efficiency. 

d) at least one functional unit that executes a first program, said functional unit coupled to said 
second port of said register files, said fimctional imit responsive to commands exclusively 
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involving architectural register operands that map onto to the registers within a register file that 
is in the active state. See Fig. 1 and note that the first set of fiinctional units would be component 
5, which executes manipulation instructions like multiplication, addition, etc. In order to do this, 
it must be coupled to registers, as shown in Fig.l, for retrieving an operating upon operands, 
e) a data assembly unit responsive to an instruction set comprising at least: 

(i) a command that causes data to be moved between the DRAM array and a register file 
that is in the inactive state. See Parady, and note the thread switch command that from 
the abstract and from Fig.3. More specifically, when a thread switch occurs, the register 
file associated with the "switched-in" thread will become active and consequently used 
by the system during execution of that thread. At the same time, the register file 
associated with the "switched-out" thread will become inactive. 

(ii) Inagami has taught a contmiand to transfer data between the DRAM array and a data 
register file. See Fig. 4. Neither Inagami nor Parady have taught such a transfer involves 
an inactive register file. However, Bissett has taught performing data prefetches (loads) 
in the background. And, this is beneficial because background operations do not 
influence software execution, thereby allowing the current program to run normally while 
also accomplishing a task in the background. See column 4, lines 3-8. And, as is known 
in the art, prefetching is beneficial because data is brought in from memory before an 
instruction needs it. Then, when the instruction does actually execute, the data is already 
fetched, allowing the instruction to execute more quickly. Consequently, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify 
Inagami in view of Parady in view of Bissett such that Inagami in view of Parady 
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performs background prefetching to a register file is not currently in use (one that 
corresponds to an inactive thread). 

(iii) ***]sjoTE*** As an alternate interpretation, an inactive register file could be nothing 
more than a file which is not currently being accessed by the functional units. This could 
be the case in Fig. 1 , where there are many register files. Some may not be needed by 
functional units 5. These would be inactive and a load instruction may be the instructions 
used to transfer data to the inactive file. In addition, when the functional units do request 
operands from the inactive file (say in response to an add instruction), then the inactive 
file becomes active and accessible to the functional unit. Applicant should amend 
accordingly to avoid both interpretations of Inagami, etc. 
54. Referring to claim 49, Inagami has taught a digital processor comprising: 
a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig.2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less communication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
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time of the invention to first modify Inagami's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) first and second dual-port register files, each capable of parallel transfer of data between an 
entire row of said embedded DRAM array in a single operation. See Fig.l, Fig.4, Fig.5, and 
column 1, lines 52-64, and column 6, lines 48-53. Note that each 4-register set is capable of 
loading/storing an entire row of the DRAM array in response to a single load instruction (latch 
signal). Note that a single DRAM array row may correspond to a single entry shown in storage 
21 in Fig.4. For instance, data aO is in row 0, data al is in row 1, etc. Or, in an alternate 
interpretation, a single DRAM row may correspond to four consecutive entries shown in storage 
21 in Fig.4. A row is merely a number of items arranged in a line. Hence, any number of data 
items in storage 21 can be called a "row". Also, it should be noted that these files are at least 
dual-port in that they have ports for reading and writing (see Fig. 1). 

c) Inagami has not taught that each of said register files is also capable of being placed into an 
active state and an inactive state. However, Parady has taught a thread switch command which 
performs such state toggling. More specifically, in Parady, every time a thread switch occurs, a 
new register file becomes active and the previously active register file becomes inactive. This 
allows for thread switching without having to save or reload a context of a thread, thereby saving 
processing time. A person of ordinary skill in the art would have recognized that multithreading 
and thread switching are beneficial tools for a system because when one thread stalls, another 
thread may be switched in and executed, thereby preventing the processor from going idle. With 
thread switching comes the switching of register files as well. Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to modify Inagami to toggle 
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between active and inactive register file states because switching of register sets supports thread 
switching, which as described above, prevents a processor from staying idle during a stall, 
thereby increasing throughput and efficiency. 

d) first and second functional units. See Fig.l, components 5 and 2, respectively. 

e) a method for processing data comprising: 

(i) manipulating data in a data register with a register file using said first functional unit 
that is in an active state. See column 4, lines 35-37. 

(ii) using said second fiinctional unit. See column 4, lines 32-35. 

(iii) placing an inactive register file into said active state, whereby when the register set is 
activated, it becomes an architectural register set of said first fiinctional unit. See Parady, 
and note the thread switch command that fi-om the abstract and fi-om Fig.3. More 
specifically, when a thread switch occurs, the register file associated with the "switched- 
in" thread will become active and consequently used by the system during execution of 
that thread. 

(ii) Inagami has taught unidirectionally transferring data between an entire row of said 
embedded DRAM array and a selected data register file, wherein the transfer of the data 
occurs in a single operation. See Fig.4. Neither Inagami nor Parady have taught such a 
transfer involves an inactive register file. However, Bissett has taught performing data 
prefetches (loads) in the background. And, this is beneficial because background 
operations do not influence software execution, thereby allowing the current program to 
run normally while also accomplishing a task in the background. See column 4, lines 3- 
8. And, as is known in the art, prefetching is beneficial because data is brought in from 
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memory before an instruction needs it. Then, when the instruction does actually execute, 
the data is already fetched, allowing the instruction to execute more quickly. 
Consequently, it would have been obvious to one of ordinary skill in the art at the time of 
the invention to modify Inagami in view of Parady in view of Bissett such that Inagami in 
view of Parady performs background prefetching to a register file is not currently in use 
(one that corresponds to an inactive thread). 

(iii) ***]sjoTE*** As an alternate interpretation, an inactive register file could be nothing 
more than a file which is not currently being accessed by the functional units. This could 
be the case in Fig.l, where there are many register files. Some may not be needed by 
functional units 5. These would be inactive and a load instruction may be the instructions 
used to transfer data to the inactive file. In addition, when the functional units do request 
operands from the inactive file (say in response to an add instruction), then the inactive 
file becomes active and accessible to the functional unit. Applicant should amend 
accordingly to avoid both interpretations of Inagami, etc. 
55. Referring to claim 50, Inagami has taught a processor comprising: 
a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
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processor and DRAM can be embedded on a single chip. See Fig.2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less communication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) an embedded row address register that holds a pointer that points to a row of the DRAM array. 
See Fig.l, component 60, and column 5, lines 13-17. 

c) embedded first and second register files, each of said register files capable of loading or 
storing an entire row of the DRAM array in response to a single latch signal. See Fig.l, Fig.4, 
Fig.5, and column 1, lines 52-64, and column 6, lines 48-53. Note that each 4-register set is 
capable of loading/storing an entire row of the DRAM array in response to a single load 
instruction (latch signal). Note that a single DRAM array row may correspond to a single entry 
shown in storage 21 in Fig.4. For instance, data aO is in row 0, data al is in row 1, etc. Or, in an 
ahemate interpretation, a single DRAM row may correspond to four consecutive entries shown 
in storage 21 in Fig.4. A row is merely a number of items arranged in a line. Hence, any 
number of data items in storage 2 1 can be called a "row". 

d) Inagami has not taught that each of said register files is capable of being placed into an active 
state and an inactive state. However, Parady has taught a thread switch command which 
performs such state toggling. More specifically, in Parady, every time a thread switch occurs, a 
new register file becomes active and the previously active register file becomes inactive. This 
allows for thread switching without having to save or reload a context of a thread, thereby saving 
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processing time. A person of ordinary skill in the art would have recognized that multithreading 
and thread switching are beneficial tools for a system because when one thread stalls, another 
thread may be switched in and executed, thereby preventing the processor from going idle. With 
thread switching comes the switching of register files as well. Therefore, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to modify Inagami to toggle 
between active and inactive register file states because switching of register sets supports thread 
switching, which as described above, prevents a processor from staying idle during a stall, 
thereby increasing throughput and efficiency. 

d) embedded first and second functional units, said first and second functional units having 
respective first and second instruction sets and capable of accessing said first and second register 
files while in said active state. See Fig.l and note that the first set of fimctional units could be 
component 5, which executes manipulation instructions like multiplication, addition, etc. See 
column 4, lines 35-37. The second set of functional units could be component 2 of Fig.l, which 
executes data transfer instructions such as loads and stores. See column 4, lines 32-35. 

e) wherein said first and second register files comprise a parallel access port (see connections 
between register files and switches 120-123) operative to parallely fransfer contents of one of 
said register files between a DRAM row as selected by said row-address register (again see Fig.l 
and note that these connections allow register files to be loaded with DRAM contents), each said 
register file further comprising at least a second access port operative to transfer data between a 
selected register file and said second fimctional unit (see the connections in Fig. 1 between the 
switches 120-123 and the load/store pipes (second fiinctional unit. . .these allow data to be read 
from the register files by the pipes and ultimately stored in DRAM). 
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f) wherein said first instruction set comprises at least: 

(i) a command to manipulate data in a data register with a register file. See column 4, 
lines 35-37. 

g) wherein said second instruction set comprises at least: 

(i) a command to perform an arithmetic operation on said row address register. See 
Fig.6, column 7, line 66, to column 8, line 9, and note that arithmetic is performed on the 
row address registers by the start address calculation unit 310. 

(ii) a command to load the entire row pointed to by said row address register into a 
selected set of registers of said register files, wherein loading the entire row is performed 
in a single operation. See Fig.4, and column 1, lines 52-59. Note that each row of the 
memory shown in Fig.4 which corresponds to a mask bit equal to 1 is transferred to a 
register file. For instance, aO, which is clearly the first row in the main storage, is 
transferred to a register file. The data aO is the entire row data. Inagami in view of 
Parady has not taught that the selected set of said register files are to be loaded in the 
inactive state. However, Bissett has taught performing data prefetches (loads) in the 
background. And, this is beneficial because background operations do not influence 
software execution, thereby allowing the current program to run normally while also 
accomplishing a task in the background. See column 4, lines 3-8. And, as is known in 
the art, prefetching is beneficial because data is brought in from memory before an 
instruction needs it. Then, when the instruction does actually execute, the data is already 
fetched, allowing the instruction to execute more quickly. Consequently, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify 
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Inagami in view of Parady in view of Bissett such that Inagami in view of Parady 
performs background prefetching to a register file is not currently in use (one that 
corresponds to an inactive thread). 

56. Claim 44 is rejected under 35 U.S.C. 103(a) as being unpatentable over Inagami in view 
of Luk in view of Parady in view of Bissett and further in view of Farmwald. 

57. Referring to claim 44, Inagami in view of Luk in view of Parady in view of Bissett has 
taught a processor as described in claim 28. 

a) Inagami in view of Luk in view of Parady and further in view of Bissett has not explicitly 
taught a command to precharge a row of the DRAM array, whereby the second functional unit 
executes a speculative precharging to prevent program delays due to DRAM row precharging. 
However, it is known that precharging must occur in a DRAM. Farmwald has taught an 
instruction which explicitly speculatively precharges a row. See column 7, lines 49-55, and note 
that a bus transaction instruction is issued. It includes the fields shown in Fig.4. When bit 2 of 
AccessType is set to the appropriate value or when the 3^** bit of AccessType is set to the 
appropriate value, a row precharge will occur. See column 12, lines 34-53, and column 11, line 
25-60. Note that such a command allows for precharging ahead of time so that when an actual 
load occurs, for instance, the load can begin immediately, thereby saving time. Consequently, it 
would have been obvious to one of ordinary skill in the art to include a speculative precharge 
command in the instruction set of Inagami, as taught by Farmwald. 
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58. Claim 46 is rejected under 35 U.S.C. 103(a) as being unpatentable over Inagami in view 
of Luk and further in view of BCrishnamohan et al., U.S. Patent No. 5,499,355 (herein referred to 
as Krishnamohan). 

59. Referring to claim 46, Inagami has taught a digital processor comprising: 

a) an array comprising a plurality of random access memory cells arranged in rows and columns. 
See Fig.l and note that main memory (component 1) inherently comprises rows and columns of 
memory cells. Inagami has not explicitly taught that the memory array is an embedded DRAM 
array (embedded on the same chip as the processor). However, Official Notice is taken that 
DRAM and its advantages are well known and accepted in the art. More specifically, DRAM is 
a very popular memory technology because of its high density (due to structural simplicity) and 
low price (in comparison to other memory such as SRAM). Furthermore, Luk has shown that a 
processor and DRAM can be embedded on a single chip. See Fig.2. Luk has further taught, in 
column 2, lines 1-14, that such an integration allows for the elimination of off-chip drivers and 
heavy capacitive loads, low power dissipation, and less communication time between the two 
components. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to first modify Inagami 's main storage to be DRAM, for the advantages 
described above, and to then embed the DRAM array on the same chip as the processor. 

b) at least one row address register. See Fig.l, component 60, and column 5, lines 13-17. 

c) one or more sets of registers capable of being loaded or stored. See Fig.l, components 
VMR0-VMR7 and VR0-VR7. Note that these registers may be loaded or stored in response to a 
single load or store signal shown in Fig.4, Fig.5, and column 1, lines 52-64. 

d) a method of processing data comprising: 
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(i) performing an arithmetic operation on said at least one row address register in order to 
manipulate a pointer that points to a row of the embedded DRAM array. See Fig.6, 
column 7, line 66, to column 8, line 9, and note that arithmetic is performed on the row 
address registers by the start address calculation unit 310. 

(ii) Inagami in view of Luk has not explicitly taught speculatively precharging 
(activating) a row pointed to by said at least one row address register based at least 
partially upon historical program execution data indicating a possible need to perform 
one or more load or store operations that would access said row. However, 
Krishnamohan has taught a predictor which predicts that a load will possibly occur at 
some point (based on historical PC values stored in the predictor) and as a result, 
performs a prefetch (i.e., speculative fetch). See Fig.7 and column 7, lines 20-64. This is 
clearly advantageous because, as is known in the art, fetching from main memory 
(DRAM) is time costly. By loading the data into temporary storage before the data needs 
to be actually loaded, at least some of the main memory access time is masked, thereby 
speeding up the system. As a result, it would have been obvious to one of ordinary skill 
in the art at the time of the invention to modify Inagami to speculatively fetch, from 
memory, based at least partially on historical program execution data indicating a 
possible need to perform a load that would access the address speculatively fetched from. 
It follows that since a DRAM row must be precharged in order to fetch from it, it must be 
speculatively precharged prior to speculatively fetching from it. 

(iii) Inagami in view of Luk and fiirther in view of BCrishnamohan has that that in 
response to a separate command executed after the speculatively precharging, loading a 
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plurality of words of a row designated by said at least one row address register into 
designated sets of data registers in a single operation. See Fig.4, and column 1, lines 52- 
59, of Inagami. 

60. Claim 47 is rejected under 35 U.S.C. 103(a) as being unpatentable over Inagami in view 
of Luk in view of Krishnamohan and further in view of Farmwald. In addition, Garcia, U.S. 
Patent No. 4,827,476, is cited as extrinsic evidence which shows that DRAM is inaccessible 
during refresh cycles. 

61 . Referring to claim 47, Inagami in view of Luk and further in view of Krishnamohan has 
taught a method as described in claim 46. Inagami in view of Luk has not explicitly taught 
deactivating rows pointed to by said at least one row address register. However, it is known that 
any load that occurs in a DRAM is a destructive load. That is, reading a value in DRAM causes 
the charges making up that value to diminish. Consequently, after a value is read, the DRAM 
row must be refreshed. Farmwald has taught a command which performs an explicit refresh. 
See Fig.4 and column 12, lines 58-60. As is known in the art, a refresh deactivates the DRAM 
because, while it is being refreshed, no read/write requests can be serviced. See Garcia, column 
1, line 66, to column 2, line 1 for support for this concept. As a result, in order to ensure that the 
DRAM doesn't lose its contents after a read, it would have been obvious to one of ordinary skill 
in the art at the time of the invention to modify Inagami to include Farmwald 's command to 
perform a refi"esh (i.e., a deactivate) of a particular row after it has been precharged (and read). 
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Response to Arguments 

62. Applicant's arguments filed on November 27, 2007, have been fully considered but they 
are not persuasive. 

63. Applicant argues the novelty/rejection of claims 23, 45, and 50 on pages 27-30 of the 
remarks because applicant does not believe that each register file is dual-ported (or has first and 
second ports). 

64. However, as pointed out in the rejection of claims 23 and 50, a first port is that from the 

register files (VRx) to the switches, and the second ports is that from the switch to the load/store 
pipes. Note that it can be said that the register file has these ports because its data appears on 
both of these ports en route to its destination. 

65. Regarding claim 45, applicant merely claims dual ported register files, which elements 
VR0-VR7 are according to Fig. 1 . Looking at Fig. 1 , a first port, at the very least is the port 
coupling the switch 120 to VRO, for instance, and a second port is that coupling register file VRO 
to switch 273, for instance. 

66. Applicant argues the novelty/rejection of claims 1, 13, 16, 23, 28, 46, 49, and 50 on pages 
30-34 of the remarks because appUcant does not believe that the register files are capable of 
loading/storing either an entire row or selected columns of an entire row of a DRAM array either 
in a single operation or in response to a single latch signal, nor does the prior art teach a 
command for doing the same. 
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67. However, See Fig.4, Fig.5, and column 1, lines 52-64, and column 6, lines 48-53. Note 
that each 4-register set is capable of loading/storing an entire row of the DRAM array in 
response to a single load instruction (latch signal). Note that a single DRAM array row may 
correspond to a single entry shown in storage 21 in Fig.4. For instance, data aO is in row 0, data 
al is in row 1, etc. Or, in an alternate interpretation, a single DRAM row may correspond to four 
consecutive entries shown in storage 21 in Fig.4. A row is merely a number of items arranged in 
a line. Hence, any number of data items in storage 21 can be called a "row". 

68. Therefore, the load instruction of Fig.4 is a single operation conmiand that loads an entire 
row into a selected set of registers. The examiner asserts that applicant is reading "row" too 
narrowly, i.e., as the entire set of data in storage 21 shown in Fig.4. That is, applicant appears to 
be reading "row" as values a0-al5. Applicant, when applying such an interpretation, does make 
a valid argument. However, the examiner asserts that "row" can comprise any subset of that 
data. For instance, aO-al form a row of data. 

69. Applicant's arguments with respect to claim 46 on pages 34-36 have been considered but 
are moot in view of the new ground(s) of rejection, which was necessitated by the amendment. 

70. Applicant argues the novelty/rejection of claim 1 on pages 36-40 of the remarks because 
applicant does not beUeve that Inagami includes a command to deactivate a row pointed to by a 
row address register after the row has been precharged. In general, applicant argues that the 
examiner's rejection of claim 1 using a refresh command is invalid because refresh does not 
deactivate DRAM since read/write is still performed. 



Application/Control Number: 10/074,779 Page 54 

Art Unit: 2183 

7 1 . The examiner disagrees because the read/Zwrite being performed is merely for 
maintaining the contents of the DRAM. Said read/write is not a read/write included as part of 
the program code. During a refresh, program code cannot access the memory. Hence, the 
DRAM is deactivated with from the viewpoint of program execution. 

72. Applicant argues the novelty/rejection of claims 16, 28, 45, 49, and 50 on pages 40-43 of 
the remarks because applicant does not believe that the prior art of record has taught that active 
and inactive register files are determined based on whether or not a data assembly unit has 

completed assembly of data within the register file. 

73. In response to apphcant's argument that the references fail to show certain features of 
applicant's invention, it is noted that the features upon which applicant relies (i.e., activity and 
inactivity determined based on completed assembly of data) are not recited in the rejected 
claim(s). Although the claims are interpreted in light of the specification, limitations from the 
specification are not read into the claims. See In re Van Geuns, 988 F.2d 1 181, 26 
USPQ2d 1057 (Fed. Cir. 1993). 

74. The examiner is to give the broadest reasonable interpretation, and this has occurred 
through the use of Parady. 

Conclusion 

75 . Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). 
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A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to David J. Huisman whose telephone number is (571) 272-4168. 
The examiner can normally be reached on Monday-Friday (8:00-4:30). 

If attempts to reach the examiner by telephone are unsuccessfiil, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-41 62. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an apphcation may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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Primary Examiner, Art Unit 2183 

April 15, 2008 



